Neural signatures of automatic repetition detection in temporally regular and jittered acoustic sequences

Detection of repeating patterns within continuous sound streams is crucial for efficient auditory perception. Previous studies demonstrated a remarkable sensitivity of the human auditory system to periodic repetitions in unfamiliar, meaningless sounds. Automatic repetition detection was reflected in different EEG markers, including sustained activity, neural synchronisation, and event-related responses to pattern occurrences. The current study investigated how listeners’ attention and the temporal regularity of a sound modulate repetition perception, and how this influence is reflected in different EEG markers that were previously suggested to subserve dissociable functions. We reanalysed data of a previous study in which listeners were presented with sequences of unfamiliar artificial sounds that either contained repetitions of a certain sound segment or not. Repeating patterns occurred either regularly or with a temporal jitter within the sequences, and participants’ attention was directed either towards the pattern repetitions or away from the auditory stimulation. Across both regular and jittered sequences during both attention and in-attention, pattern repetitions led to increased sustained activity throughout the sequence, evoked a characteristic positivity-negativity complex in the event-related potential, and enhanced inter-trial phase coherence of low-frequency oscillatory activity time-locked to repeating pattern onsets. While regularity only had a minor (if any) influence, attention significantly strengthened pattern repetition perception, which was consistently reflected in all three EEG markers. These findings suggest that the detection of pattern repetitions within continuous sounds relies on a flexible mechanism that is robust against in-attention and temporal irregularity, both of which typically occur in naturalistic listening situations. Yet, attention to the auditory input can enhance processing of repeating patterns and improve repetition detection.


Introduction
Detection of repeating patterns is crucial for efficient perception of sounds that continuously unfold in time [1,2].Especially in complex listening situations that involve several simultaneously active sound sources, recognition of familiar sound patterns facilitates the segregation of sound streams and enables rapid adaptive reactions to change in the environment [3][4][5][6][7].There is compelling evidence that the human auditory system is exceptionally sensitive to pattern repetitions in sounds, even when the acoustic signal contains only minimal spectro-temporal structure such as in the case of (periodic) white noise [8][9][10][11][12][13].
A growing number of studies has moved beyond using strictly isochronous pattern repetitions and asking participants to complete an active repetition detection task.In fact, any mechanism that can possibly support pattern repetition detection in real-life listening situations should be somewhat tolerant to listeners' in-attention and temporal irregularity with regard to pattern occurrence in the stimulus stream.Several studies showed that this is indeed the case: A negativity in the ERP was elicited relative to the onset a repeating pattern in white noise not only when participants' attention was focussed on the auditory stimuli, but also when they were presented with the noise sequences while reading a book [33], performing a visual distractor task [15], and even during sleep [16].Similarly, pattern repetitions in white noise and sequences of tone pips led to an increase in ITPC while participants were asleep [16] or focussed on a concurrent visual task [25].A repetition-related increase in sustained response magnitude to sequences of tone pips in the absence of listeners' attention to the auditory stimulation was reported by some studies [22,[27][28][29], but not by others [25].Only one study investigated the role of temporal regularity for the detection of pattern repetitions in tone pip sequences: In this study [26], the authors reported a negativity time-locked to repeating pattern onsets that was elicited consistently across temporally regular and jittered sequences, whereas the earlier positivity occurred only in regular sequences.They therefore argued that, while the negative component is related to the repetition of a specific pattern (irrespective of temporal regularity), the additional positive component in regular, temporally predictable sequences reflects neural entrainment to the periodic stimulus rhythm and anticipation of upcoming pattern occurrences [26].
Taken together, neither attention nor temporal regularity appears to be indispensable for the successful detection of repeating patterns in continuous sounds.However, since earlier studies only focussed on either of the two factors and not always directly compared different levels of attention or regularity, less is known about the interaction between attention and regularity and about whether they substantially modulate repetition perception.For instance, it remains unclear whether irregular repetitions could also be detected in the absence of attention, and whether attention and regularity improve (or in-attention and irregularity impair) the detection of pattern repetitions.Moreover, previous findings revealed some discrepancy with regard to the influence of attention on different repetition-related EEG markers (often analysed only in separate studies).One study analysed both sustained activity and ITPC within the same dataset and found that temporal regularity of a sound led to an increase of ITPC irrespective of the listeners' attentional state, while an increase in sustained activity was only observed during attention (but not during in-attention; [25]).Therefore, the authors argued that the two markers might reflect functionally dissociable stages of repetition perception [25].The current study aims to systematically assess in a two-by-two design how attention and temporal regularity (as well as their interaction) shape pattern repetition perception and influence its different neural signatures (within the same dataset).To this end, we presented listeners with sequences of correlated noise that contained (or did not contain) repetitions of a certain sound segment, with repetitions occurring either in a temporally regular or jittered manner, while attention was directed either towards the pattern repetitions or away from the auditory stimulation.We analysed three different EEG markers, all of which were previously related to successful repetition detection in different studies, but not systematically compared within the same dataset: global field power (GFP) as a measure of sustained activity throughout the sequence, ERPs time-locked to repeating pattern onsets, and ITPC.That way, we might be able to reconcile previous, partly discrepant, findings on the role of attention and regularity and provide a more comprehensive view on auditory repetition perception and its neural correlates.

Materials and methods
The present study is a reanalysis of a dataset that was previously used to explore a different research question, namely the formation of memories for recurring sound patterns across trials [34].Conversely, the current analysis investigates the perception of pattern repetitions within sounds.

Participants
29 participants (26 female, three male), aged 18 to 32 years (M = 21.38 years, SD = 3.21 years), took part in the study.None of them reported impaired hearing or a history of any neurological or psychiatric disorder, and all of them had normal or corrected-to-normal vision.Participants were recruited at Leipzig University between April and July 2022.All participants were naïve regarding the purpose of the study, gave written informed consent before the start of the experiment, and received course credits for their participation.Consent forms were stored separately from the experimental data, and any personal data were pseudonymised, such that after data collection individual participants could not be identified.We obtained written approval by a local ethics committee (Ethics Advisory Board at Leipzig University; reference number: 2022.01.26_eb_128) prior to the study, and all experimental procedures were in accordance with the Declaration of Helsinki.

Stimuli
We used sequences of correlated noise as auditory stimulus material.Correlated noise was described in detail by McDermott and colleagues [5] and refers to randomly generated white noise sequences that were transformed using a generative model to match statistical properties of natural sounds.This type of stimulus material offers the advantage that the stimuli resemble natural sounds more closely than previously used artificially generated stimuli (e.g., white noise) while retaining good acoustic control.Stimulus sequences were created using the Gaussian Sound Synthesis Toolbox (http://mcdermottlab.mit.edu/Gaussian_Sound_Code_for_Distribution_v1.1) in Matlab (version R2021a; The MathWorks Inc., USA), with a duration of 3500 ms, including 5-ms onset and offset ramps (half-Hanning windows).Transformation of the white noise sequences resulted in correlated noise sequences with a correlative structure, i.e., adjacent sampling points along the temporal and spectral dimension were correlated with regard to their spectral energy values, and the strength of this correlation decreased with increasing distance.Decay constants were the same as in the original study (-0.065 per 20 ms and -0.075 per 0.196 octaves), such that the structure of the generated stimuli matches the correlative structure of natural sounds [5].
We created sequences of random correlated noise without repetitions ("no repetition"; norep) and sequences that contained repetitions ("repetition"; rep).In rep sequences, a certain 200-ms segment occurred in total six times throughout the sequence.Rep sequences were created by inserting a separately generated 200-ms sound pattern into the 3500-ms sequence.For half of the rep sequences within an experimental block the same repeating 200-ms pattern was used, whereas for the other half a new pattern was created for each sequence.Pattern recurrence across trials was unrelated to the present research question and only relevant for our previous investigation of memory formation for specific recurring patterns across multiple trials, which was performed on the same dataset (for details, please refer to our publication of this previous study; [34]).As the procedure of inserting short patterns into the longer continuous sequence resulted in local disruptions in the correlative structure of the sound at pattern boundaries, we controlled for these local changes by inserting six (different) 200-ms segments into no-rep sequences.Cross-fading (using 5-ms half-Hanning windows centred 2.5 ms relative to the beginning and -2.5 ms relative to the end of an inserted 200-ms patterns) was used to avoid audible artefacts due to abrupt changes in the spectrum at segment boundaries.In all sequences, the time point of the first pattern onset was selected randomly between 50 and 500 ms relative to sound onset.The following pattern onsets occurred either with a constant interval of 300 ms (regular) or variable intervals between 50 and 550 ms (jittered) between patterns.In jittered sequences, intervals between patterns were chosen randomly, with the restriction that the duration of two consecutive inter-pattern intervals must differ by at least 50 ms.Stimulus sequences are illustrated in Fig 1 (panel A), and audio examples can be found in the online supplemental material (https://osf.io/xn9t4/).

Procedure
Participants completed two EEG sessions on separate days (with on average 13 days in between).In the first session, listeners' attention was directed away from the auditory sequences (no-attention), which they were instructed to ignore while performing a visual distractor task that required continuous monitoring of the visual stimulation.In the second session, their attention was directed towards the auditory sequences (attention) by a repetition detection task, which required them to indicate in each trial whether the sequence contained a repetition.The fixed session order served to avoid active knowledge about the repetitions in the auditory sequences during the no-attention session after participants performed the auditory repetition detection task in the session before.In each session, they completed five blocks with regular and five blocks with jittered sequences in a random order, with breaks between blocks as required.Each block consisted of 60 randomly ordered auditory sequences, 50% of which were rep and no-rep sequences, respectively.In 50% of the rep sequences per block, the repeating pattern was the same across trials within the block, whereas the remaining rep sequences contained a repeating pattern that occurred in only one trial throughout the experiment.Between two consecutive sequences, silent intervals ranged between 2175 and 2625 ms in duration (in steps of 50 ms).The experimental design is illustrated in Fig 1 (panel  B).
The visual display in the no-attention session consisted of eight squared dark-grey frames (width/height: 0.50˚visual angle) arranged in a circle (radius: 2.11˚visual angle) on a grey background at equal distance from a white fixation cross.In each of the 240 visual trials per block, a white square appeared at one of the eight frame positions for 150 ms.Participants were asked to fixate the cross in the centre of the screen and press a button a quickly as possible whenever the white square appeared at the same frame position as two trials before.The first five trials of each block were always non-targets, and 2-back targets occurred randomly in 10% of the trials, each of which was followed by at least two non-targets.While square positions were chosen randomly for non-target trials, targets occurred equally often at each position.The visual stimulus onset asynchrony ranged between 1425 and 1575 ms (in steps of 10 ms), and visual stimulation had no temporal relationship with the auditory stimulation.Auditory stimulation began five seconds after the visual stimulation at the beginning of each block.At the beginning of the session, participants completed a short training block without concurrent auditory stimulation, during which they received feedback about the correctness of their response in each trial.During the actual experiment, feedback (hit/false alarm rates and mean reaction time) was provided only at the end of a block.
At the beginning of the attention session, the different types of auditory sequences were introduced to the participants.An example sequence (which was not used during the actual experiment) was provided for sequences with "regular repetitions" (rep, regular), "irregular repetitions" (rep, jittered) or "no repetitions" (no-rep) and could be repeated as often as listeners wanted.They were informed that repetitions occurred in 50% of the trials and that regular and irregular sequences occurred in separate blocks.A white fixation cross on a grey background was displayed during sound presentation, followed by the response options ("repetition"/"no repetition") during the response interval (until a response was given or a maximum of 2000 ms expired).Participants pressed either the left or the right button (counterbalanced across participants) on a response time box with their left or right index finger, respectively.Feedback (percentage of correct responses) was again provided at the end of a block.
Participants were seated inside an acoustically and electrically shielded chamber during the experiment.Task instructions and visual stimuli were presented on a computer screen located at approximately 80 cm distance from the participants' eyes.Auditory stimuli were delivered binaurally via headphones (Sennheiser HD-25-1, Sennheiser GmbH & Co. KG, Germany).Stimulus presentation and response registration was controlled using the Psychophysics Toolbox extension (PTB-3; [35,36] in GNU Octave (version 5.2.0), and behavioural responses were recorded with a response time box (Suzhou Litong Electronic Co., China).

EEG data acquisition
We recorded the continuous EEG from 64 Ag/AgCl electrodes mounted in an elastic cap according to the extended 10-20 system.To record horizontal and vertical eye movements, additional electrodes were placed on the outer canthus of both eyes and above and below the right eye.Signals were also recorded from the left (M1) and right (M2) mastoid and from and electrode placed on the tip of the nose, which served for offline referencing.Offsets of all electrodes were kept below 30 μV.Signals were referenced to the CMS-DRL ground, amplified with a BioSemi ActiveTwo amplifier (BioSemi B.V., Amsterdam, The Netherlands), and digitised with a sampling rate of 512 Hz.

Data analysis and statistical inference
Since the focus of the current study was the perception of pattern repetitions within a sound (and not the effect of pattern recurrence across trials as in the previous study; [34]), all sequences with pattern repetitions were collapsed into the same condition (rep) for the present analysis.To make sure that the repetition of patterns across sequences did not bias the current results, the analysis was repeated analogously excluding sequences that contained repetitions of patterns that reoccurred across trials.This approach yielded a virtually identical pattern of results, thus we decided to include all sequences for the sake of statistical power.

Behavioural data
Analysis of behavioural data was done in RStudio (version 4.0.2,RStudio Inc., USA).Performance in the repetition detection task in the attention session was analysed within the framework of signal detection theory [37].Trials were classified as hits when participants correctly indicated that a rep sequence contained repetitions and as false alarms when they erroneously indicated that a no-rep sequence contained repetitions.We then computed the d' sensitivity index from hit and false alarm rates separately for regular and jittered blocks, applying a loglinear transformation [38] to avoid infinite values.To statistically test whether there was a difference in repetition detection performance between regular and jittered blocks, we compared d' scores using a two-sided paired t-test, with the standard .05alpha criterion to define statistical significance.Bayesian tests were computed, using the package "BayesFactor" [39,40], and Bayes Factors (BF 10 ), interpretable as the posterior probability of the null (H 0 ) and alternative hypothesis (H 1 ) given the data, are reported in addition to the frequentist statistics.BF 10 > 3 (10) was considered moderate (strong) evidence for the alternative hypothesis and BF 10 < 0.33 (0.1) was considered moderate (strong) evidence for the null hypothesis, in accordance with widely used conventions [41], and values in between were considered inconclusive.
Pre-processing.Pre-processing of EEG data was done separately for each of the two sessions per participant.After re-referencing the data to the channel on the tip of the nose, noisy channels were identified if their signal variance exceeded an absolute z-score of 3.0.These channels were excluded from pre-processing and later spherically spline interpolated.The remaining data where then high-pass and low-pass filtered using Kaiser-windowed sinc finite impulse response (FIR) filters.The cut-off for the low-pass filter was 35 Hz (transition bandwidth: 5 Hz, maximum passband deviation: 0.001, filter order: 372), while high-pass filters with different cut-offs were applied for the three EEG markers that we analysed (see below).After filtering, the continuous data were epoched from -100 to 4000 ms relative to sequence onset.To remove physiological and technical artefacts, an independent component analysis (ICA) was used, computed on a copy of the data filtered with a 1-Hz high-pass filter (transition bandwidth: 0.5 Hz, maximum passband deviation: 0.001, filter order: 3710) to improve signal-to-noise ratio for the decomposition.Before ICA decomposition, epochs with a peak-to-peak difference exceeding 750 μV were discarded and data were down-sampled to 128 Hz.ICA weights, obtained with an infomax algorithm implemented in the runica function in EEGLAB, were transferred to the dataset with the final pre-processing parameters.Classification of independent components was done automatically using the IC Label plugin [44], and all components classified as eye blinks, muscle or cardiac activity, line or channel noise were removed.Any auditory event within 500 ms before and after a button press or within 500 ms after a visual target in the no-attention session was excluded from the analysis to minimise the influence of motor and visual activity on auditory EEG responses.
Sustained response: Global field power (GFP).For the analysis of sustained activity, data were high-pass filtered (during pre-processing) with a low cut-off at 0.1 Hz (transition bandwidth: 0.2 Hz, maximum passband deviation: 0.001, filter order: 9274) to avoid filtering out slow potential shifts that are characteristic for the sustained response.From the pre-processed data, we extracted epochs that ranged from -100 to 3000 ms relative to the onset of the first pattern per sequence and baseline-corrected them to the 100-ms interval prior to first pattern onset.Sequence epochs were discarded if their peak-to-peak difference exceeded 300 μV, and the remaining epochs were re-referenced to the average of all channels.For each participant, averages were computed for rep and no-rep sequences in each of the four attention and regularity conditions.GFP at each sampling point was computed from these within-participant averages as the root mean square (RMS) of the signal across all scalp electrodes [45].
For statistical evaluation, mean GFP was extracted for each attention and regularity condition from a time window that ranged from 500 to 3000 ms relative to the first pattern onset, i.e., from the first pattern repetition to the end of the sequence.We used a three-way repeatedmeasures ANOVA (implemented in the R package "ez") with the factors Repetition (rep, norep), Attention (attention, no-attention), and Regularity (regular, jittered) to test whether GFP differed between sequences with and without sequences, and whether this effect is modulated by attention and regularity.Greenhouse-Geisser correction was applied whenever Mauchly's test indicated non-sphericity (p < .05).A corresponding Bayesian ANOVA [46] was again computed in addition to the frequentist ANOVA.Reported BF 10 's reflect the evidence for models that include the respective (main or interaction) effect relative to reduced matched models without the respective effect (in line with recent recommendations; [47]).A significant main effect of Repetition would indicate that the brain successfully picked up the pattern repetitions within sound sequences, and a significant interaction of Repetition with Attention or Regularity would indicate that the repetition effect is modulated by the respective factor.To further elucidate the nature of the modulation by Attention or Regularity, significant (p < .05)two-way interactions with Repetition were followed up using (both frequentist and Bayesian) paired t-tests.Specifically, we computed the rep vs. no-rep contrast separately for the two levels of the modulating factor (Attention or Regularity), and subsequently compared the repminus-no-rep difference between the two levels (i.e., attention vs. no-attention, or regular vs. jittered).
Event-related potential (ERP) responses to repeating pattern onsets.For the ERP analysis, data were filtered with a 1-Hz high-pass filter (transition bandwidth: 0.5 Hz, maximum passband deviation: 0.001, filter order: 3710) in order to filter out slow potentials.Extracted epochs ranged from -100 to 500 ms relative to single pattern onsets, averaged across the second to the sixth pattern occurrence per sequence.Pattern epochs were discarded if their peak-to-peak difference exceeded 150 μV, and no baseline correction was applied.After re-referencing to the algebraic mean of both mastoids, we computed first within-participant averages and then grand averages across participants for rep and no-rep sequences in each of the four attention and regularity conditions.
A non-parametric cluster-based permutation approach was used to determine time windows of interest for the statistical evaluation of mean ERP amplitudes.To identify clusters of significant differences in amplitude between rep and no-rep sequences at adjacent sampling points along both temporal and spatial dimension, we computed a cluster-based permutation test on rep vs. no-rep averages across the four attention and regularity conditions [48,49].Averaging across attention and regularity conditions before computing the cluster-based permutation test served to reduce the risk of biased analysis parameter choices [50].Both alpha level and cluster alpha were set to 0.05, and cluster-level significance probability was estimated using a Monte Carlo approximation with 1000 permutations.In the time range from 0 to 500 ms relative to pattern onset, the test revealed an early positive cluster, followed by a negative cluster, both of which were broadly distributed across fronto-centro-temporal electrodes.Significant time points were extracted for both of these clusters, resulting in two time windows of interest, the first one ranging from 0 to 160 ms and corresponding to an early positivity, and the second one ranging from 190 to 380 ms and corresponding to a subsequent negativity.
Mean amplitudes were extracted from these two time windows at a fronto-central cluster of nine electrodes (F1, F2, Fz, FC1, FC2, FCz, C1, C2, Cz).Statistical evaluation was done separately for the positivity and negativity, and followed the same procedures as described above for the sustained response.
Inter-trial phase coherence (ITPC).For the analysis of ITPC, data were high-pass filtered with a cut-off at 0.5 Hz (transition bandwidth: 0.5 Hz, maximum passband deviation: 0.001, filter order: 3710).Pre-processed data were epoched from -200 to 800 ms relative to single pattern onsets at the second to the sixth pattern occurrence per sequence.Pattern epochs were demeaned, and any epoch with a peak-to-peak difference that exceeded 150 μV was discarded.Signals were averaged within the same fronto-central electrode cluster as for the ERP analysis (see above), and 1500-ms zero-padding was applied at both ends of each epoch.We then used a convolution with Morlet wavelets to extract phase information from single epochs over a frequency range from 1 to 10 Hz (in steps of 0.2 Hz), with parameters of the wavelet linearly adjusted from three to seven wavelet cycles.ITPC between epochs was computed for each participant from the results of the wavelet convolution at each sampling point in the time-frequency space, separately for rep and no-rep sequences in each of the four attention and regularity conditions.We again used a cluster-based permutation approach to determine the time-frequency window of interest for statistical evaluation.After averaging across the four attention and regularity conditions, we computed a cluster-based permutation test (rep vs. norep), with an alpha level and cluster alpha of 0.001 (and again using a Monte Carlo approximation with 1000 permutations to estimate cluster-level significance probability).The test revealed a broad significant cluster that ranged from 0 to 500 ms relative to pattern onset and spanned a frequency range from 1 to 4 Hz.
We extracted mean ITPC from this time-frequency window of interest for subsequent statistical evaluation, which followed the same procedures as for the analysis of sustained response and ERPs to repeating pattern onsets.

Behavioural data
Participants detected pattern repetitions in the acoustic sequences on average above chance in both regular (M ± SD of d': 2.01 ± 0.97) and jittered (M ± SD of d': 1.84 ± 1.11) blocks.There was no significant difference between the two (t(28) = 1.92; p = .065;d = 0.36; BF 10 = 0.99), however Bayesian evidence was inconclusive.Thus, there might in fact be a trend towards better repetition detection performance in regular than in jittered sequences, though the effect of temporal regularity seems to be rather small.

Discussion
The current study set out to test whether and how listeners' attention and the temporal regularity of pattern occurrence within a continuous sound sequence modulate pattern repetition perception.We presented listeners with sequences of correlated noise that contained or did not contain repetitions of a certain sound segment.Pattern repetitions within a sequence were either temporally regular or jittered, and listeners' attention was either directed towards or away from the sounds during stimulus presentation.Besides behavioural repetition detection (when participants attended to the sounds), we measured repetition perception and its modulation by attention and regularity by means of three different EEG signatures: sustained activity throughout the full sequence (from repetition onset), ERPs and ITPC time-locked to repeating pattern onsets.
Overall, listeners were able to behaviourally detect repetitions well above chance (when they attended to the sounds), and successful repetition detection was reflected consistently in all three neural markers across attention and regularity conditions.Concretely, repetitions of a specific pattern within a continuous acoustic stimulus led to an increase in sustained activity from the first pattern repetition through the end of the sequence (for consistent previous results, see: [22,24,25,[27][28][29]), a characteristic positivity-negativity complex in the ERP timelocked to repeating pattern onsets [15,16,18,20,26,33], and enhanced ITPC of low-frequency (1-4 Hz) oscillations [15,16,19,20,25].It is plausible to assume that both our ERP and ITPC findings reflect a similar underlying effect, namely a phase alignment of neural activity relative to repeating pattern onsets.Consistent findings from both complementary approaches underline the robustness of the repetition effect and allow to compare our data to a number of previous studies that reported only either one or the other EEG marker.Notably, besides replicating findings of different earlier studies, we could demonstrate automatic detection of irregular, unpredictable pattern repetitions while listeners focussed on a demanding visual distractor task.Thus, we show that not only strict periodicities [15,16,22,25,[27][28][29]33], but also more irregular pattern repetitions within continuous auditory input are processed pre-attentively.This suggests that repetition detection does not rely on a merely temporal mechanism (i.e., the detection of an autocorrelation with a fixed time lag in the acoustic signal), but may involve enhanced sensitivity for previously encountered patterns (e.g., through rapid plastic changes), leading to amplified neural responses for pattern repetitions.
While pattern repetitions were detected automatically in both regular and jittered sequences during both attention and in-attention, repetition perception was substantially modulated by both factors.Our two-by-two within-subject design allowed to directly compare different levels of attention and regularity, and to show that an attentional focus onto the sounds substantially enhanced repetition perception.The repetition effect (i.e., the difference between sequences with and without pattern repetitions) was larger during attention than during in-attention to the auditory stimulation across all three neural markers.In contrast, earlier studies had suggested rather comparable magnitudes of the repetition effect between attention and no-attention as reflected in sustained activity [27], ERPs [15,16,33], and ITPC [25].However, most of these studies did either not compare attention conditions directly [15,16,22], used a between-subject design [27], or controlled attention less strictly [33].We argue that attention to the stimulus sequences (and, in particular, potential repetitions therein) enhances perceptual representation of the sound and thereby facilitates repetition detection.Sharpened short-term representations of the repeating pattern through attention may in turn boost robust memory formation for specific patterns that recur across multiple trials at a longer time scale (and potentially higher level of abstraction), which has been demonstrated previously [8, 13-17, 21, 23, 24, 30-32].Nevertheless, it should be noted that, despite the attention-related improvement, pattern repetitions are generally tracked automatically, also in the absence of attention.
Conversely, the influence of temporal regularity on repetition perception appeared somewhat less clear and consistent across different neural markers: While there was no difference in amplitude and morphology of the ERP to repeating pattern onsets between regular and jittered sequences, the repetition effect tended to be smaller for regular sequences in terms of sustained activity, but larger in terms of ITPC.The absence of a regularity-related difference in the ERP effect is only partly in agreement with the results of a previous study [26], which showed no difference in amplitude of the negative ERP component between regular and jittered pattern repetitions, whereas the early positive ERP component exclusively occurred in the regular condition.By contrast, the occurrence of both components across regular and jittered sequences in our data suggests that positivity and negativity do not subserve different functions (e.g., tracking of stimulus periodicity vs. detection of repeating pattern onsets), but rather that the positivity-negativity complex as a whole is related to pattern repetition detection.Nevertheless, the (descriptive) forward shift of the onset of the positivity for regular (compared to jittered) pattern onsets may indicate that anticipation of upcoming pattern occurrences in predictable sequences is reflected in the latency (but not in the magnitude) of the ERP response.If anticipation of upcoming pattern onsets indeed modulates the time course of the ERP such that the early positivity reaches into a time window before actual pattern onset, baseline correction could introduce amplitude differences between regular and irregular sequences by differentially shifting the whole positivity-negativity complex into a negative or positive direction (which may also explain discrepancies with regard to the presence of the early positivity in earlier studies, e.g., [15,26]).A similar interpretation may hold for the stronger ITPC effect we observed for regular than for jittered sequences: The strict periodicity in the stimulation allowed for (additional) entrainment of brain responses to the stimulus rhythm and for temporal prediction of the next pattern onset, which was not possible in unpredictable jittered sequences.Importantly, the presence of a significant repetition-related ITPC increase for jittered sequences suggests that the phase alignment of EEG responses cannot be explained merely by entrainment to the stimulus periodicity.Instead, synchronisation of neural responses relative to repeating pattern onsets occurred irrespective of their temporal regularity, possibly achieved via phase-reset of ongoing oscillations [19,51].Finally, there was a trend towards a larger repetition effect in sustained activity for jittered compared to regular sequences, which may seem counterintuitive at first glance.Especially in the attention condition, this effect seems to be driven by a GFP difference between regular and jittered sequences without pattern repetitions, whereas mean GFP was (descriptively) fairly similar for sequences with repetition.This suggests that there might have been rudimentary processing of local disruptions in the correlative structure of the stimulus sequences when they occurred strictly periodically (but not when their occurrence was jittered and unpredictable), which in turn decreased the difference between rep and no-rep sequences (i.e., the repetition effect).
While in our design regular and jittered sequences are treated as two different categories, one might argue that also within the jittered condition individual sequences differ with regard to the amount of temporal jitter they contain.The restriction that consecutive inter-pattern intervals must differ in duration by at least 50 ms served to avoid that sequences in the jittered condition were (almost) isochronous by chance, but cannot preclude that the variance in inter-pattern interval duration is larger in some "more jittered" compared to other "less jittered" sequences.For instance, some jittered sequences may have by chance contained more successive short inter-pattern intervals than others or some higher-order non-isochronous temporal regularity (e.g., an alternation between short and long inter-pattern intervals), both of which may have facilitated repetition detection in these trials.However, it should be noted that the amount of jitter was larger in the current study (up to ± 50% of the mean interval between pattern onsets) than in previous studies that had reported a substantial benefit of isochronous pattern repetition on repetition detection (up to ± 20%; [52]) or memory formation for specific white noise exemplars (± 10%; [17]) or found differences in pattern-related ERPs between isochronous and jittered sequences (up to ± 40%; [26]).Thus, it seems unlikely that the amount of jitter used in the present study explains the absence of a clear regularity benefit (such as it was demonstrated previously).
Unlike a previous study [25], we did not find evidence for a distinct pattern of attention modulation between sustained activity and phase coherence of neural oscillations.If anything, our data provide more evidence for an attention modulation of the repetition effect in ITPC than in GFP, whereas the authors of this previous study [25] reported an attention effect only for sustained activity, but not for ITPC (i.e., neural synchronisation).They proposed that the distinct susceptibility of sustained activity and neural synchronisation to the influence of attention may indicate that the two neural markers reflect dissociable processes, such that neural synchronisation is related to an early attention-independent sensory process and sustained activity to a more abstract representation of structure in sounds that requires attention [25].While this does not preclude that different EEG markers reflect functionally nuanced processes that contribute to (automatic) repetition perception, our data suggest that all of them underlie a similar modulatory influence by attention.Different weighting of putative subprocesses and their susceptibility to attention (and possibly regularity) modulation might rather arouse from subtle differences in the experimentally created listening context (e.g., specific stimulus material and distractor task).
In summary, our study replicates the results of earlier studies that showed rapid and automatic detection of pattern repetitions within continuous acoustic sequences.Crucially, pattern repetitions are processed pre-attentively even if there is no temporal regularity that could act as a cue for upcoming (predictable) pattern occurrences.This suggests that repetition perception relies on a mechanism that flexibly adapts to varying contextual demands, such as they occur in naturalistic listening situations.Yet, an attentional focus on the auditory input enhances sensory representation of repeating patterns and facilitates repetition detection.

Fig 4 .
Fig 4. Phase coherence of neural oscillations.Inter-trial phase coherence (ITPC) over frequencies and time relative to the onset of repeating patterns at position 2 to 6 within the sequence (0 ms) at a fronto-central electrode cluster for rep and no-rep sequences in each of the four Attention x Regularity conditions.Bar plots display mean ITPC between 1 and 4 Hz in a time window from 0 to 500 ms relative to pattern onset (marked by dotted lines).Error bars indicate ± 1 SEM.https://doi.org/10.1371/journal.pone.0284836.g004